Learning from Unscripted Deictic Gesture and Language for Human-Robot Interactions
نویسندگان
چکیده
As robots become more ubiquitous, it is increasingly important for untrained users to be able to interact with them intuitively. In this work, we investigate how people refer to objects in the world during relatively unstructured communication with robots. We collect a corpus of deictic interactions from users describing objects, which we use to train language and gesture models that allow our robot to determine what objects are being indicated. We introduce a temporal extension to stateof-the-art hierarchical matching pursuit features to support gesture understanding, and demonstrate that combining multiple communication modalities more effectively capture user intent than relying on a single type of input. Finally, we present initial interactions with a robot that uses the learned models to follow commands.
منابع مشابه
Recognition of Deictic Gestures with Context
Pointing at objects is a natural form of interaction between humans that is of particular importance in human-machine interfaces. Our goal is the recognition of such deictic gestures on our mobile robot in order to enable a natural way of interaction. The approach proposed analyzes image data from the robot’s camera to detect the gesturing hand. We perform deictic gesture recognition through ex...
متن کاملDesigning nonverbal communication for pedagogical agents: When less is more
This experimental study employed a 2 2 2 factorial design to investigate the effects of type of instruction (procedural module, attitudinal module), deictic gesture (presence, absence), and facial expression (presence, absence) on student perception of pedagogical agent persona, attitude toward the content, and learning. The interaction effect between type of instruction and agent nonverbal beh...
متن کاملOntogeny of Two Communicative Tools: Distance Encoding & Multimodality in Deictic Pointing
The aim of this paper is to experimentally study the development of speech/gesture relationship within language. The participants, both adults and children, had to designate a target placed at two different distances. They had to perform a deictic gesture accompanied by a deictic word, or to use either of the two modalities independently. Using one vs two pointing modalities allowed us to speci...
متن کاملInvestigating the Effects of Visual Saliency on Deictic Gesture Production from a Humanoid Robot
In many collocated human-robot interaction scenarios, robots are required to accurately and unambiguously indicate an object or point of interest in the environment. Realistic, cluttered environments containing many visually salient targets can present a challenge. In this paper, we describe an experiment and results detailing the effects of visual saliency and pointing modality on human percep...
متن کاملA Corpus of Natural Multimodal Spatial Scene Descriptions
We present a corpus of multimodal spatial descriptions, as commonly occurring in route giving tasks. Participants provided natural spatial scene descriptions with speech and abstract deictic/iconic hand gestures. The scenes were composed of simple geometric objects. While the language denotes object shape and visual properties (e.g., colour), the abstract deictic gestures “placed” objects in ge...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014